A Multi-Resolution Approach to GAN-Based Speech Enhancement
نویسندگان
چکیده
Recently, generative adversarial networks (GANs) have been successfully applied to speech enhancement. However, there still remain two issues that need be addressed: (1) GAN-based training is typically unstable due its non-convex property, and (2) most of the conventional methods do not fully take advantage characteristics, which could result in a sub-optimal solution. In order deal with these problems, we propose progressive generator can handle multi-resolution fashion. Additionally, multi-scale discriminator discriminates real generated at various sampling rates stabilize GAN training. The proposed structure was compared enhancement algorithms using VoiceBank-DEMAND dataset. Experimental results showed approach make faster more stable, improves performance on metrics for
منابع مشابه
A Multi-Microphone Post-Filtering Approach for Speech Enhancement
Multi-microphone post-filtering allows additional noise reduction at a beamformer output. Existing techniques are either restricted to classical delay-andsum beamformers, or are based on single-channel speech enhancement algorithms that are inefficient at attenuating transient noise. In this paper, we introduce a multimicrophone post-filtering approach, applicable to adaptive beamformer, that d...
متن کاملA multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
In this paper a multi-channel speech enhancement framework for distant speech acquisition in noisy and reverberant environments for Non-negative Matrix Factorization (NMF)-based Automatic Speech Recognition (ASR) is proposed. The system is evaluated for its use in an assistive vocal interface for physically impaired and speech-impaired users. The framework utilises the Spatially Pre-processed S...
متن کاملA perceptual kalman filtering-based approach for speech enhancement
A new approach for single channel speech enhancement based on Kalman filtering and masking properties of the human auditory system is proposed in the paper. A standard time-varying Kalman filtering method is extended by combining the calculation of noise masking thresholds during the process of parameter updating. Simulation results of a traditional spectral subtraction method, an extended spec...
متن کاملA Chance Constraint Approach to Multi Response Optimization Based on a Network Data Envelopment Analysis
In this paper, a novel approach for multi response optimization is presented. In the proposed approach, response variables in treatments combination occur with a certain probability. Moreover, we assume that each treatment has a network style. Because of the probabilistic nature of treatment combination, the proposed approach can compute the efficiency of each treatment under the desirable reli...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app11020721